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Abstract 

I explore several related routes to deriving the Jordan-algebraic structure of finite- 
. dimensional quantum theory from more transparent operational and physical prin- 

^ I ciples, mainly involving ideas about the symmetries of, and the correlations between, 

probabilistic models. The key tool is the Koecher-Vinberg Theorem, which identifies 
. formally real Jordan algebras with finite-dimensional order-unit spaces having ho- 

mogeneous, self-dual cones. 

o 

m 

^ ■ Introduction 

'■^ I These notes pull together some ideas for motivating the Jordan-algebraic structure of 

J3 ' finite-dimensional quantum theory from principles having a more obvious operational or 

probabilistic meaning. The key tool is the Koecher-Vingerg theorem, wich lets us identify 
formally real Jordan algebras with finite-dimensional order-unit spaces with homoge- 
^ I neous, self-dual cones. The strategy is to motivate homogeneity and self-duality of the 

I cone of "effects" associated with a general probabilistic model, in terms of independently 

O . meaningful (and, ideally, plausible) principles. 
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Rather than offering a single set of axioms from which this structure can cleanly be 
derived, I explore in some detail the consequences of various assumptions, mainly to do 
with the symmetries of a system, and with the possibility of correlating this system with 
a canonical "conjugate" system. Afterwards, I observe that several different axiomatic 
packages can be extracted from the results of this study, any of which will enforce the 
^ I homogeneity and self-duality of the cone generated by a system's basic measurement 

; outcomeslll 

In a bit more detail, a finite dimensional probabilistic model specifies a set of basic mea- 
surements, a (compact) convex set of states — understood as probability weights on mea- 
surement outcomes — and a compact group of symmetries under which the both the set 
of measurements and the set of states are invariant. Any such model A gives rise, in a 
canonical manner, to an order-unit space E(A), in which the positive cone is generated by 
the model's measurement outcomes. Any normalized, positive linear functional on E(A) 



^This is in accord with a prejudice of mine, namely, that quantum theory (at any rate, its probabilistic 
framework) does not have a single, stark physical meaning, but is more analogous to, say, the class of 
normal probability distributions, which arise in many different contexts for many different reasons — but 
which can be characterized in ways that lead us to expect this ubiquity. It also reflects the conviction that 
the various conditions considered here, and the structures that they constrain, are of independent interest, 
and merit a systematic study. 
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gives rise, by restriction, to a probability weight on measurement outcomes. I call the 
model state-complete if its state space contains every such weight. 

Call a model bi-symmetric iff the group of symmetries acts transitively on pairs of distinct 
measurement outcomes, and on pure states. Where the state space is irreducible, bi- 
symmetry implies the existence of at most one G-invariant bilinear form on E(A) that 
is positive on E(A)+ and simultaneously orthogonalizes distinct measurement outcomes. 
Moreover, if it exists, this form is an inner product. If the model is also state-complete, 
it follows that the self-duality of the cone is equivalent to a condition called sharpness: 
every measurement outcome has probability one in a unique state. 

It remains to secure the existence of an orthogonalizing invariant, positive form on E(A). 
I suggest three (related) ways of doing so. One is to postulate the existence, for every 
system A, of a conjugate system A, canonically isomorphic to A, and a bipartite non- 
signaling state between A and its conjugate in which every measurement is perfectly, 
and uniformly, correlated with its image in A. This state (analogous to the Bell state 
in quantum mechanics) then gives rise to the required bilinear form on E(A). Another 
approach is to require the existence of a bi-symmetric composite of two copies of A, and 
an invariant state in which the two component systems are independent. A third is to 
ask that all systems under consideration be representable as a set of objects in a dagger- 
monoidal category. 

This work builds upon the earlier papers [SI El IHl EHl I30II . In particular, it echoes, but 
improves upon, the last of these. While I have included enough detail to make this paper 
reasonably self-contained, I do assume the reader has at least a glancing familiarity with 
the lingo of ordered vector spaces and convex cones, and more or less remembers what a 
Jordan algebra is. The book [13] by Faraut and Koranyi contains an excellent introduc- 
tion to homogeneous self-dual cones and Jordan algebras, and includes a very accessible 
proof of the Koecher-Vinberg Theorem. See also t2J for a recent survey of this material 
with particular reference to quantum theory. 



1 Order-Unit Spaces and Probabilistic Models 

Let me begin by fixing some notation and terminology, recalling along the way some 
basic facts about ordered vector spaces. First, a convention: absent any statement to 
the contrary, all vector spaces considered here are finite dimensional. The dual space of 
a (finite-dimensional) vector space V is denoted V* ; the space of linear transformations 
V ^ W is denoted by ^(V, W), with L(V) abbreviating L(V, V). 

By a cone in a real vector space V, I will always mean a closed, convex, pointed, generating 
cone K — that is, a topologically closed convex set if c V, closed under multiplication by 
non-negative scalars, satisfying K n -K = {0}, and spanning V (whence, V = K - K). An 
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ordered vector space is a real vector space V with a distinguished cone if =: E+ This 
determines a translation-invariant partial order, given by a < 6 iff 6 - a £ V+ ; thus, V+ = 
{a G V|a > 0}. The standard or pointwise cone of V = [R^ is the cone of non-negative 
functions. The standard cone in the space S^(H.) of Hermitian operators on a (real or 
complex) Hilbert space H consists of such operators of the form aa* . 

If V and W are two ordered vector spaces, a linear mapping (/) : V ^ W is positive iff 
(p(V+) Q W+. Note that this is a cone. If is a linear isomorphism and is positive 
■ — equivalently, if (f>(V+) = W+ — then (p is an order-isomorphism between V and W. An 
order-isomorphism V - V is an order automorphism of V. The dual cone V+ is the set of 
positive linear functionals f eV* . 

An order unit on V is a positive functional u e V* that is strictly positive, i.e, uix) = for 
X £ V+ only if X = 0. This is equivalent (in finite dimensions, anyway), to the condition 
that, if f £ V* , then f <nu for some ;z £ N. More generally, an order unit in an ordered 
vector space E is an element u £ E+ such that, for every a £ E+, there exists neN with a < 
nu. An order-unit space is a pair (E,u) where E is an ordered vector space and a £ E+ is 
an order unit. Order unit spaces arise very naturally (as we'll see below) as probabilistic 
models. One defines an effect to be an element a £ E+ with a<u; a discrete observable on 
E is a set {at} of effects with Y.iO-i- u. A state on E is a positive functional a £ E* with 
a(u) = 1, so that, for any observable, the mapping ai ^ aiat) defines a probability weight 
on every observable {at}. We speak of aiat) as the probability of at occuring when the 
observable {a J is measured in state a. 

As an illustration, if H is a finite-dimensional complex Hilbert space, let L(H) denote the 
space of Hermitian operators on H, ordered by the cone of positive operators (operators 
of the form aa*). Then {^(H.),Tr) is an order-unit space, in which the observables are 
exactly the discrete "POVMs" representing quantum observables, and the states are the 
linear functionals a '—^ Tr(pa) corresponding to density operators p on H. 

The order unit space L(H) has two very striking geometric properties: 

Definition 1 (Self-Duality and Homogeneity): An order-unit space E is self-dual iff 
there exists an inner product (, ) on E such that 

E+=E+:={a£E I <a,&)>0 V6£E+ }. 

An order-unit space E is homogeneous iff the group of order-automorphisms (invertible 
positive mappings with positive inverses) E ^ E acts transitively on the interior of E+. 

Beyond ^(H), the cone of (Jordan) squares in any formally real Jordan algebra is homo- 
geneous and self-dual. Remarkably, this is the only example! 

Theorem (Koecher [18|, Vinberg ^25]) Let E be an homogeneous, self-dual (HSD) 
order-unit space. Then there exists a unique formally real Jordan product on E, with 
respect to which E+ is the cone of squares, and u is the identity. 
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Both homogeneity and self-duality seem a bit more transparent to physical (or opera- 
tional, or probabilistic) intuition, than does the Jordan product. So it's reasonable to try 
to motivate these two constraints independently. Homogeneity seems to present the eas- 
ier challenge. Indeed, if we view order-automorphisms of E as representing reversible 
physical processes on the corresponding system, then the homogeneity of the "state cone" 
E* simply requires that every interior state be reversibly transformable into any other by 
some physical process. Of course, the adjective "interior" is annoying here. In an earlier 
paper [7] with Howard Barnum and Philipp Gaebler, it is shown that homogeneity also 
follows from the assumption that every state on E is the marginal of a bipartite "steering" 
state. This condition also makes the state cone weakly self-dual, that is, isomorphic to its 
dual cone. However, strict self-duality requires this isomorphism to be mediated by an 
inner product, and this has proved trickier to motivate. 

A different approach, explored in [I30l|, is to derive the homogeneity and self-duality of 
the "effect cone" E+ from ideas about the symmetries of systems, and the possibility of 
correlating two copies of a system. In order to achieve this, I made use of an ad-hoc "min- 
imization" axiom (which I'll review below). Here, I aim to do better, and, in particular, to 
avoid this assumption. 

1.1 Test spaces and probabilistic models 

For my purposes, the abstract order-unit spaces dealt with above are a little too abstract. 

Definition 2 (Test spaces): A test space is a pair (X, 21) where X is a set of outcomes and 
21 is a covering of X by non-empty sets called tests, interpreted as the sets of mutually 
exclusive outcomes associated with various experiments. A prohablity weight on (X,2l) 
is a function a : X ^ [0, 1] with Y.xeE '^(^) = 1 for every £J e 21. I'll write n(X, 21) for the 
convex set of all probability weights on (X,2l)Jl 

The rank of a test space (X,2l) is the least upper bound of \E\ where £J e 21. For purposes 
of this note, all test spaces have finite rank. In particular, all tests are finte sets. It 
follows easily that the set n(X,2l) of all probability weights on (X,2l) is a closed, and 
hence, compact, subset of [0, 1]^. 

Notation: Anticipating later results, I'll write x A- y io mean that outcomes x,y e X are 
distinguishable by means of a test in 21 — that is, that y and there exists some £J g 21 
with x,y ^E. Note that, at present, there is no linear structure in view, let alone an inner 
product, so the notation is only suggestive. 

^Mathematically, a test space is just a hypergraph. The terminology is meant to enforce a particular 
interpretation. Test spaces — originally termed "manuals" — were the basis for a generalized probability 
theory (and an associated "empirical logic") developed in the 1970s and 80s by C. H. Randall and D. J. 
Foulis and their students. See Il29]| for a survey. It is important to understand that 21 is not necessarily 
intended as the complete catalogue of all possible measurements on a given system, but only some set of 
measurements sufficient to capture the system's states, which we have singled out for some reason (perhaps 
one of tradition, or of exegetical efficiency). 
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Definition 3 (Symmetry): By a symmetry of a test space (X,2l), I mean a bijection 
g :X ^ X that permutes elements of 21. 

Notice that if ^ is a symmetry, then for a\\ x, y e X , x ± y iff gx ±gy. An action of a group 
G on (X,2l) is an action by symmetries, and a test space equipped with such an action is 
a G-test space. I'll write Aut(X,2l) for the group of all symmetries of In the cases 

that will interest us, this will always be isomorphic to a compact subgroup of GL(d) for a 
sufficiently large finite dimension d . 

In constructing a model for a probabilistic system, we may want to privilege not only 
the "observables" represented by the tests E £ ^2i, but also certain states and certain 
symmetries. This suggests the following 

Definition 4 (Probabilistic Models): A probabilistic model — or, for purposes of this 
note, just a model — is a structure (X,2l,0,G), where (X,2l) is a (finite-rank) test space, 
CI is a separating, pointwise-closed (hence, pointwise compact) convex set of probability 
weights on (X,2l), and G is a compact group of symmetries of (X,2l) leaving Q invariant. 

I'll call D. the state space of the model; probability weights aeO. are states. Where O has 
finite affine dimension, I'll say that the model is finite-dimensional. All models considered 
in this paper are finite-dimensional in this sense. In the interest of sanity, I'll hereafter 
denote models by Roman capital letters A,B,..., writing (for instance) A = (X,2l,C2,G). 
It will often be convenient to label the components with the name of the model, as, e.g., 
(Z(A),2l(A),Q(A),G(A)). I will use the terms "model" and "system" interchangeably. 

It's time to look at some examples. 

Example 1: Classical models Let E he a single, classical outcome-set (say, for a coin- 
flip, or rolling a die). Let X = E, ^ = {E}, and G < SiE) be any group you like of permu- 
tations of E. Let r be any separating, permutation-invariant set of probability weights 
on E, and let Q be the closed convex hull of P. Alternatively, choose any separting closed 
convex set O of probability weights, and let G be the group of permutations leaving D. 
invariant. 

Example 2: Quantum Models Let H be an n-dimensional complex Hilbert space. The 
corresponding quantum model is A(H) := (X(H), 21(H), Q(H),i7(H)), where 

• X(H) is the set of rank-one projection operators on H, 

• 21(H) is the set of maximal pairwise orthogonal sets of such projections, 

• Q(H) is the set of states of the form x Tr(px), p a density operator on and 

• U(H.) is the group of unitary operators on H, acting on X by conjugation. 

'^Gleason's Theorem tells us that Q(H) = Q(X(H),2l(H)) for dim(H) > 2; for dim(H) = 2, the density 
matrices need to be put in by hand. 
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Example 3: The Square Bit For a much different, and much simpler, example, consider 
a test space (X,2l) consisting of two disjoint, two-outcome tests — say, X = {a,a',b,b'} 
and 21 = {{a,a'},{b,b'}}. Then the space Q of all probability weights on (X,2l) is affinely 
isomorphic to the unit square in IR^. The square bit is the model (X,2l,n,G) where G is 
the dihedral group acting on D. in the obvious way, and dually on (X,2l). 

Example 4: Jordan Models Let E be a formally real Jordan algebra, and let X denote 
the set of primitive idempotents in E. A Jordan frame is a pairwise orthogonal set of 
idempotents summing to the order unit. Letting O. denote the set of states on E and G, 
the set of Jordan automorphisms of E, we have a Jordan model (X,2l,n,G). 

1.2 Models Linearized 

Let A be a model. Every outcome x e X(A) determines an affine functional x : f2(A) — IR 
by evaluation: x(a) = a(x). Letting Af^n(A)) denote the space of all real-valued affine 
functionals on D(A), we have then a natural — and, clearly, G-equivariant — mapping 
X(A) — ► Aff(n(A)). It is largely harmless to assume that this is injective, i.e., that 0.(A) 
separates points of X. (If not, replace (X (A), 21(A)) by the obvious quotient structure.) 
From now on, I assume this is the case; that is, I make it a standing assumption that all 
probabilistic models have separating sets of states. 

In view of this, it is convenient to identify x e XiA) with the corresponding functional, 
so that X Q AS{Q.{A)). I also assume, from this point on, that all models are finite di- 
mensional, in the sense that QiA) has finite affine dimension. It follows that Aff(n) is 
a finite-dimensional real vector space. Let E = E(A) denote the span of X in AffCQ), 
ordered by the cone consisting of linear combinations of outcomes having non-negative 
coefficients: 

E+ = {Y,tiXi \xi€X, ti>0}. 

i 

Note that this may be smaller than the cone { a g E | a{a) > Va £ D. } inherited from 
Aff(n)+, and that, unlike the latter, it depends on the choice of X. Notice, too, that the 
action of G on X extends uniquely to a linear action on E, given by (ga)ia) - a{a o g) for 
all a e E and all a e D, and that E+ is stable under this action. Finally, observe that, for 
every E efH, LxeB^ = u, where u is the unit functional u{a) = 1 for all a e D. This last 
serves as an order-unit for E. 

I'll call the order-unit space (E(A), u) the linear hull of the model A. Notice that every test 
E e 21(A) can now be regarded as a discrete observable on E(A). Thus, we can, to a large 
extent, regard a probabilistic model as an order-unit space equipped with a distinguished 
collection of observables (sufficient to separate points), invariant under a distinguished 
compact group of order-automorphisms, and with a distinguished convex set D. of states 
(of which, more in a moment). 

Examples: In the case of a quantum model A = A(H), E(A) can be identified with the 
order-unit space if (H) of Hermitian operators on H, ordered by the usual cone, with u 
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the identity operator. In the case of the square bit, E(A) is isomorphic to U^, equipped 
with a cone having a square cross-section. In the case of a Jordan model, E(A) is canoni- 
cally isomorphic, as an order-unit space, to the given Jordan algebra. 

1.3 Sharpness and State-Completeness 

If (E, u) is any order-unit space, a state on E is a positive linear functional p : E ^ [R that 
is normalized so that p{u) - 1. If A = iX,f2l,D,,G) is a model, with linear hull E(A), then 
any state a e D. defines a state on E(A), just by evaluation: aia) := aia). Conversely, a 
state p on E defines a state on the test space (X,2l) by restriction. In general, however, 
this will not lie in the designated state space n(A) of the model. 

Let D.iA) denote the set of all states on (X,2l) arising from states on E(A). Obviously, 

c Q. We may regard D. as the set of probability weights that are consistent with all 
of the linear relations among outcomes that are satisfied by the given state space Q(A). 
Evidently, the assignment Q i-^ Q is a closure on the poset of closed convex subsets of the 
(full) state space of (X,2l). Let's agree to call a model state-complete iff Q = Q. In this 
case, E+ coincides with the cone E n Aff(n(A))+, i.e, every element of E+ taking positive 
values on D. belongs to E+. (This last condition is called saturation in (BJ). 

Example 5: For an example of a non-state complete model, let E = {x,y} be a single 
two-outcome classical outcome-set, and consider the probability weights pi,P2 given by 
Pi(x) = .6,pi(y) = .4, and P2ix) = .4,p2iy) = .6. The set {pi,P2} is invariant under the 
obvious action of G = S2 = SiE), and separates x and y. Let A = iE,{E},D.,S2) where D, 
is the closed convex hull of pi and p2, i.e., Q - {tpi + (1 - t)p2\0 <t< 1.}. Then E(A) 
with E+ the first quadrant. The full state space Cl consists of all probability weights on 
E, and is thus considerably larger than n0 

All of the non-classical models discussed above are state-complete. State-completeness 
is a pretty reasonable condition to impose on a probabilistic model, at least in a finite- 
dimensional setting, and it will figure as a crucial hypothesis in many of the results below. 
Nevertheless, in order to keep clearly in view what does and what does not depend on it, 

1 make no standing assumption of state-completeness. To help in keeping this in mind, 
I'll use the notation V(A) for the space E(A)*, ordered not by the natural dual cone, but 
by the cone V+(A) generated by the designated state space n(A). State-completeness 
amounts to the condition that V(A) = E(A)* (in which case, we also have E(A) V(A)* . 

Another condition that will play a significant role in what follows is sharpness: 

Definition 5 (Sharpness): A model A = (X,2l,n,G) is sharp iff, for every xeX, there 
exists a unique state Sx^^ with dxix) = 1. 

In the earlier papers l|29[|30ll . I called a model sharp iff, for every outcome x e XiA), there 
'^Thanks to Jon Barrett for pointing out this sort of simple example. 
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exists a unique state a e E*(A) with a(x) = 1. If A is state-complete (as was tacitly as- 
sumed in I128B ). this coincides with the oresent notion. Sharpness (in one form or another) 
has a long history in the quantum-logical literature. In particular, it played a central role 
in Gunson's axiomatics for quantum theory [ I4l. A stronger form of sharpness, in which 
it is also required that each pure state render certain a unique outcome, is used by Hardy 
inlHSll. 

1.4 Morphisms of Models 

At several points I'm going to need to treat models categorically. Thre are various notions 
of morphism one might use, but the one that makes the most sense in the current context 
seems to be the following. 

Definition 6 (Morphisms): A morphism from a model A to a model fi is a pair i(f>,y/), 
where 

(i) is a mapping X(A) X{B), pushing tests of A forward to tests of B, and pulling 
states of B back to states on A — that is, 

0(2l(A)) c 21(5) and (p*{D.{B)) c n(A) 

(where (/)*(iS) = iSo(/).) 

(ii) VGHom(G(A),G(5)); 

(iii) (pigx) = ^f{g)(p{x) for all x e XiA),g e G{B). 

In practice, it will be convenient to regard y/ as defining an action of G(A) on X{B), writing 
gy for \f/{g)y for g g G(A) and y g X(B). When I wish to suppress explicit mention of y/ 
in this way, I'll simply write (p for the pair i(f>,y/). From this point of view, (iii) says that 
<p is equivariant. (Note, though, that the given action of G(A) on XiB) must be through 
elements of G(B).) 

An isomorphism of models is an invertible morphism; equivalently, a bijective mapping 
(p : XiA) XiB), equivariant with respect to an action of G(A) on XiB) (by members of 
G(B)), taking 21(A) bijectively onto 2l(fi), and inducing an affine isomorphism <p* : D.iB) 
n(A). In particular, every symmetry of a model A is a morphism from A to itself 

A morphism (p : A ^ B lifts naturally to a positive linear mapping between the corre- 
sponding linear hulls. To spell this out, notice that the affine mapping (p* : D,(B) —- QiB) 
guaranteed by condition (i) of the definition, induces a linear map (p** : Aff(n(A)) —>■ 
AmniB)), given by i(p**a)ip) = ai(p*ip)) = a(;6o0). Identifying x e XiA) with the cor- 
responding vector x e E(A) < Aff(n(A)), and similarly taking (jyix) e E(B) < Aff(n(S)), it 
follows that (f>**{x) = (pix). Thus, (p** restricts to a linear mapping (p : E(A) E(fi) ex- 
tending (p : XiA) —- XiB). Since this takes outcomes to outcomes, it sends E+(A) into 
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E+iB), that is, is positive. Note, too, that if £ g 21(A) and F = (p(E) e 21(B), we have 
(piuA) - T.xeE(p(x) - T-yeFy - ub- Thus, we Can regard A ^ E(A) as the object part of a 
functor from probabihstic models and morphisms, to order-unit spaces and positive, unit- 
preserving Hnear maps. This observation will be put to use in due course. 



2 Bi-Symmetric Models 

I now wish to impose some constraints on the models under consideration. This section 
spells out some consequences of a package of symmetry assumptions which, taken to- 
gether, assert that (i) all pure states, all outcomes, and all tests tests look the same, and 
(ii) individual tests have no (or little) internal structure, in the sense that the outcomes 
of any test can be permuted more or less freely by symmetries of the model, keeping the 
test fixed. 

Definition 7a (Full symmetry): A test space (X,2C) is fully symmetric under the action 
of a group G iff (i) every test £J g 21 has the same cardinality, and (ii) every bijection 
f :E ^ F, E,F e^, is implemented by some element g^G, i.e., gx = fix) for every xeE. 

See 11271 [251 for more on this notion. Full symmetry entails that G act transitively on 
both 21 and X. 

Example 6: Let E be a formally real Jordan algebra, and let X denote the set of primitive 
(that is, atomic) idempotents in E. Let 21 be the collection of all finite subsets of X 
summing to the unit element of E, and let D. be the set of all p g E+ with {p,u} = 1 
where ( , > is the canonical inner product on E. Finally, let G be the group of all Jordan 
automorphisms of E. Then iX,f2l,D.,G) is a probabilistic model. Moreover, it is fully 
symmetric (031, Theorem IV.2.5). 

A weaker condition than full symmetry, still sufficient for most of what follows, is that 
G act transitively on 21 and on the set of orthogonal (that is, distinguishable) pairs of 
outcomes: 

Definition 7b (2-Symmetry): (X,2l) is 2-symmetric under the action of G iff (i) G acts 
transitively on 21, and (ii) G acts transitively on pairs of distinguishable measurement 
outcomes, that is, for all outcomes x,y,u,v e X with x ± y and u ±v, then there exists 
some g £ G such that gx - u,gy - v. Note that any fully-symmetric test space is also 
2-transitive. 

In the context, not of a test space, but of a probabilistic model, I am also going to ask that 
G act transitively on the set of pure states. Thus, 

Definition 8 (Bi-symmetry): A model (X,2l,n,G) is fully bi-symmetric, respectively bi- 
symmetric, iff (i) G acts fully symmetrically, resp., 2-symmetrically, on (X,2l), and (ii) G 
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acts transitively on extreme points of Q. 



Bi-symmetric models can readily be constructed "by hand", as follows C^SI. Suppose E is 
a set, thought of as the outcome-set of a "standard test", and suppose if is a group acting 
2-transitively on E. Let G be any group with G > H, and let if < G be any subgroup of 
G with K nH = , the stabilizer in H of some point Xg^E. Set X = G/K, and embed E 
in X via hxg hK, where h cH. (The condition that K nH = H^^ guarantees that this 
is well-defined). Let 21 be the orbit ofE in ^(X) under G, that is, 21 = {gE\g e G}. Then 
G acts 2-symmetrically on (X,2l). Now choose any So g n(X,2l), and set Q be the closed 
convex hull of G6o- See [27ll28l for more on this construction. The possibility of freely 
construcing bi-symmetric models in this way means, on the one hand, that bi-symmetry 
is a reasonably benign assumption, but also that it is not a very constraining one. 

Remark: Individually, state-transitivity and state-completeness are very reasonable ax- 
ioms: the former asks that we construct our state space in a natural way (as just out- 
lined); the latter asks that we enlarge our state space, if necessary, in an equally natural 
way. However, there is a tension between these reasonable requirements, in that enlarg- 
ing the state space to secure state-completeness may spoil state-transitivity. We can only 
rarely satisfy both conditions at onceo 

2.1 SPIN forms 

Until further notice, A = (X,2l,0,G) is a bi-symmetric model of rank n, and E - E(A). 

Definition 9 (SPIN forms): Let ^ : E x E ^ IR be a bilinear form. I will say that SS 
is positive iff SSia,b) > for all a,b e E+, normalized iff B{u,u) = 1, and invariant iff 
SSiga,gb) - B(a,b) for all g ^ G. I'll call a symmetric positive, invariant, normalized 
bilinear bilinar form on E a SPIN form for shortH . 

There is a more or less canonical example, namely, the inner product 



where Sg is any pure state (that is, extreme point) in D. and the integration is with respect 
to normalized Haar measure. Owing to the transitivity of G on the set of pure states, this 
is independent of the choice of dg 

We can also define a degenerate SPIN form SSq, defined by 

SSoia,b)= {a,u)G{b,u)G 

for all a,& £ E. This turns out to be independent of the choice of the SPIN inner product 
(indeed, of the SPIN form) appearing on the right. 

^This should not be too dismaying: a set of axioms must be in some tension with one another if they are 
to single out a narrow class of models. 

^Of course, this is an absolutely dreadful choice of terminology; but I can't seem to think of anything 
better at the moment. Suggestions? 
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Any SPIN form on E is associated with two non-negative real constants: 

(1) := BB{x,x) for all xeX, and 

(2) c : = SSix,y) for all x,y £X with x ±y. 

Call these the parameters ofSS (though, as we'll now see, they are not independent of one 
another). 

Lemma 1: Let be a symmetric, positive, invariant, normalized bilinear form on E. 
Then the parameters r and c satisfy 

(a) S§ix, u) = 1/n for all x £ X; 

(b) r^ + in-l)c = l/n 

(c) r^<l/n. 

(d) Let m and M denote, respectively, the minimum and maximum values ofSSonXxX. 
Then m < 1/n^ < M. 

(e) If S& is positive-semidefinite, r^ > lln'^ > c. 

(f) If is an inner product and r^ = 1/n^, then E is one-dimensional. 

Proof: For (a), note that SS(x,u) is a constant, again by transitivity of G on X, whence, 
n^ix,u) = Y.xeE^^^^u) = SS(u,u) = 1. For (b) note that if x g £J g 21, we have S§ix,u) - 
HyeE^i^^y) - SS{x,x) + Y.yeE\{x] SS{x,y) -r'^ + {n- l)c, which gives the desired inequality. 
Since > 0, this also yields (c), as < + (n - 1)cb- For (d), note that for any E e 21, we 
have n^m < T.x,yeE SS{x,y) = S§iu, u) = 1; similarly, n^M > SS{u, u) = 1. 

For (e), observe that if is positive semi-definite, then ||i;|| := y/^iv,v) is a semi-norm, 
with \\x\\ = r for every x£X. Hence, by the triangle inequality, we have 

l = ||j/|| = || ^x||< ^ \\x\\=nr, 

xeE xeE 

whence, > 1/n^. It now follows from (b) that 

in - l)c = 1/n -r^< 1/n - 1/n^ = 

giving us c < 1/n^. FinalUy, for (f), suppose BB is an inner product. If r^ = 1/n, then it 
follows from (b), as above, that (n - l)c = whence, c = 1/n^ as well. Hence, for any 
X -L y in X, we have 

1 2 1 

— = c = S§ix,y) = \\x\\ WyW cosd = r cosd - —cosd, 
n^ n 
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whence, the angle 6 between x and y is 0, i.e., x-y. It follows that (X,2l) has rank n-1, 
whence, dimE = 1. 

Finally, if is positive semi-definite, then by (b) and (c), we also have 

in - l)c < 1/n -r^< 1/n - lln^ = 

giving us (g). □ 

Corollary 1: If S^i and ^2 o-re SPIN forms on E, then for all a e IS,, S^x{a,u) = (a, u). 

Proof: xeX, then ^i(x, u) = S§2(x, u) = 1/n, by Lemma 1. Since X spans E, the result 
follows from the bilinearity of S§i and S§2- D 

A consequence is that the degenerate form SSo{a,b) := SS(a,u)SSib,u) is independent of 
the choice of SS. Henceforth, I refer to this SPIN form as the uniform SPIN form. 

2.2 Minimizing and Orthogonalizing Forms 

If ^ is a SPIN bilinear form on E and x £ X, we can define a probability weight on 
(X,2l) by a^iy) nSSix,y) for all y £ X. There is, however, no guarantee that this state 
will belong to the designated state space D.. 

Since G is compact, and acts continuously on E, its orbits are also compact. In particular, 
X is compact. It follows that every bilinear form — in particular, every SPIN bilinear 
form — achieves a maximimum and a minimum value on X x X. 

Definition 10 (Minimizing and Orthogonalizing SPIN forms): A SPIN bilinear 
form ^ on E is minimizing iff S§{x,y) achieves its minimum value on X x X at a point 
ix,y) with X ±y. is orthogonalizing iff c^g = 0, i.e., SSix,y) = for all x,y eX with x ±y. 

Clearly, orthogonalizing implies minimizing. In the language of this paper. Proposition 1 
of II30II asserts that if A is bi-symmetric, sharp, state-complete model, and E(A) admits a 
minimizing form, then E(A)+ is self-dual. It is also shown that, under these assumptions, 
E(A) has an orthogonalizing form. One of the main goals of the present paper is to find 
sufficient conditions for such a minimizing form to exist. 

The existence of an orthogonalizing form has many consequences. For one thing, if S§ 
is orthogonalizing, then for every x e X, the probability weight 6x '■= nSS{x,-) assigns 
probability 1 to x and to any outcome y Lx. 

If ^ is a SPIN form on E, let's agree to write E"*" for the set of vectors a £ E such that 
S^ia, 6) > for all b £ E+ (even if ^ is not an inner product). The positivity of guaran- 
tees that E+ c E+. If E+ = E+, I'll say that E is self-dual with respect to SS. The following 
is essentially proposition 1 from [|30ll . but formulated more generally, for SPIN forms 
rather than SPIN inner products: 
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Lemma 2: Let Abe a sharp, state-complete, hi-symmetric model. IfSSisa non-degenerate 
orthogonalizing SPIN form on E(A), then E(A) is self-dual with respect to S^. 

Proof: We have E+ c E"*" in any case. Let xeX. Since A is state-complete, the probability 
weight 5x '■- nSSix, •) belongs to C Since B is orthogonalizing, 6xix) - 1. Since the model 
is sharp and state-complete, 5x is the unique state with this property, and hence, pure. 
Since the model is state-complete, bi-symmetry guarantees that any pure state e: on E has 
the form gS^ = nSS(g~^x, •) for some g Every extremal vector i; e E"*" with SSiv, u)-l 
corresponds to a unique pure state Cu via Cy — S^iv , •). Since i3S is non-degenerate, it follows 
that V - ng~^x for some geG. But then i; g E+. It follows that E+ c E+. □ 

Thus, if there exists an orthogonalizing SPIN inner product on E(A) (A sharp, state- 
complete, and bi-symmetric) then E(A) is self-dual. In [30], the existence of a minimizing 
SPIN inner product was simply postulated. Most of the remainder of this paper is devoted 
to finding reasonable sufficient conditions for the existence of such an inner product. 

2.3 Irreducible Systems 

By the Corollary to Lemma 1, the ortho-complement u-^ = {x^ E|S(x,a) = 0} is indepen- 
dent of the SPIN form B. I'll say that the model A irreducible in case u-^ has no non- 
trivial G(A)-invariant subspace|j Things work especially nicely when A is irreducible in 
this sense. 

Lemma 3: Let A be irreducible, and suppose is any particular non-degenerate sym- 
metric, invariant, normalized bilinear form on E. Then all SPIN forms on E have the 
form 

SBxia, b) A^(a, 6) + (1 - A)^(a, u)mu, b) 
for some real parameter X. 

By the remark following Corollary 1, regardless of the choice of S§, we have SSg the uni- 
form SPIN form, in conformity with our earlier usage. 

Proof: Let S§' be any SPIN bilinear form. Since is non-degenerate, we have an operator 
)6 : E ^ E, self-adjoint with respect to SS, such that SS'ia,b) = S^{f5a,b) for all a,& g E. 
Since is G-invariant, ^ is G-equivariant, i.e., piga) = g^ia) for all a g E and all g^G. 

Let a-*- be the orthocomplement of u with respect to (,). By Corollary 1, 

u^ = {a e'E\m'{a, u) = Q} = {a g E|^(a, u) = 0}, 

so W"*- is invariant under ^. Let j6o denote the restriction of P to w^, noting that this is 
still self-adjoint and G-equivariant. In particular, has a real eigenvalue A, and the 
eigenspace Vx of Po is an invariant subspace of u"*". Since the latter is irreducible, and 

^Admitting that this is again lousy terminology. 
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Va 7^ (by the non-degeneracy of SS), Vx = Thus, for all ao,bo^ u^, we have 

If a,b are now arbitrary vectors in E, we can write a = Uo + ai and b = bo + bi, where 
ao,bo G M"*" and ai = S§{a,u)u, bi = SS{b,u)u. By Corollary 1, we have SS'{ao,u) = SSiao,u) - 
0, so that SB'{ao,bi} = Mb,u)SS'iao,u) = 0; likewise, SB'{ai,bo) = 0. Thus, 

SS'ia,b) = ^'iao + ai,bo + bi) = SSiao,bo) + Mai,bi) = Xm'iao,bo) + ^ia,u)Bib,u) 

Since Uq = a-B{a,u)u and bo = b - SSib,u)u, we can also write this as 

SB'ia, b) = X{S§(a, b) - 2S§(a, u)SS{b, u) + SSia, u)S§(b,u))) + S§(a, u)SS{b, u). 

Simplifying, this gives us S§'ia,b) = XS§{a, 6) + (1 - X)SS{a, u)SS{b, u), as promised. □ 

Suppose now that cx - SSx(x,y) for orthogonal x,y, and let c = SSix,y) where S§ is a chosen 
SPIN inner product (say, the standard one arising from group averaging). Then 

9 9 n^c-1 1 
cx = A(c - + = A 5— + 

which is iff 

1 ( n"" \ 1 
A = ^ 



n^ 



(Notice that 1 - n^c = only for c -lln^, which is to say, only = S^o, which we have 
ruled out by taking B to be an inner product, so this value of X is legitimate.) It follows 
that there is at most one orthogonalizing SPIN form on E, this corresponding to a non- 
negative value of A. In order to guarantee that such a form exists, we need to know 
something more about the positivity of the forms As above, let 5§ be any chosen 
SPIN inner product on E; as in Lemma 1, let m denote the minimum value of B{x,y) as 
x,y range over X, and recall that, by part (d) of Lemma 1, this never exceeds lln'^. 

Lemma 4: Let A be irreducible, with dimE(A) > 1. Let denote any particular SPIN 
inner product on E (say, the standard one arising from group averaging) on E, and let m 
and M denote, respectively, the minimum and maximum values of on X xX. Then, for 
any A £ IR, 

(a) The minimum value mx ofSSx is given by 

I Am + (l-A)/n2 = X(m - 1/n^) + 1/n^ if X>0 



mx = < 



XM + (l-X)/n^ = X{M-l/n^) + l/n^ if X<0 



(1) 



(b) SSx is positive iff 



<A< 



1-Mn'^ 1-mn'^ 
where m and M are, respectively, the minimum and maximum values of on XxX. 



^This is simply the real form of Schur's Lemma. 
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(c) 3Bx is positive-semidefinite iff X>0, and an inner product iffX>0. 

Proof: (a) If A is non-negative, the minimum of XSBix,y) + {1- X)/n^ occurs where is 
minimized; if A < 0, the minimum occurs where is maximized, (b) From part (a), we 
see that for A > 0, mx is non-negative iff A(m - 1/n^) > -1/n^, or, equivalently, (recalhng 
that m < 1/n^, so that m - 1/n^ is negative) 



nHm - lln^) 1 - mn^ 

Similarly, if A < 0, mi > iff A > 1/(1 -M/i^). (c) Now suppose that SSx(.a,a) < 0. Then 
A||a||^ + (1 - X)^ia,u)^ < 0. Since u is normalized, and ^ is an inner product, we have 
S8(a,u)'^ < \\a\\^. whence, as SS(a,u)^ > 0, we must have A < 0. Thus, if A > 0, SSx is 
positive-definite, that is, an inner product. Conversely, suppose A < 0. Let = ^(x,x) < 
1/n^ and = SSxix,x), and recall that, since is an inner product. Lemma 1 (d) gives us 
r^ > (the inequality strict, as E is not one-dimensional.) Now we have = S§xix,x) = 
A(r^ - 1/n^) + 1/n^ < 1/n^. But now, again by Lemma 1 (d), S§x is not an inner product. □ 

It now follows that, at the critical value A = 1/(1 - cn^) for which the form S§x is orthogo- 
nalizing, ^x is positive — a SPIN form — iff A < 1/(1 - mn^), i.e, 

(l-mn^)>il-cn'^) o -mn^ <-cn^ o c<m 

which occurs iff c = m, i.e., iff is minimizing. Thus, we have 

Corollary 2: Let A be irreducible. Then E supports an orthogonalizing SPIN form iff it 
supports a minimizing SPIN form. In this case, the unique orthogonalizing SPIN form is 
the form 9^, where A = j^^, the maximum value of A for which S^x is positive. This is 
an inner product. 

We are free to replace the given SPIN inner product SB in Lemma 2 with any other SPIN 
inner product. Choosing the SPIN inner product SSj for the maximal value A, we obtain 
a range of values < A < 1. Henceforth, I assume this parametrization, so that A = 1. To 
emphasize that SSi is an inner product, I'll sometimes write it as 

It follows that, where A is irreducible, the inner product = < , )i is the only candidate 
for an orthogonalizing SPIN inner product. To put it another way: if A is irreducible, then 
there exists at most one orthogonalizing SPIN form on A, and this is an inner product. 



3 Composites and Conjugates 

Evidently, what is now wanted is a phyically (or operationally, or probabilistically) natu- 
ral condition guaranteeing the existence of an orthogonalizing (equivalently, minimizing) 
SPIN form. In this section I will offer three (not entirely independent) such conditions. 
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All turn on the notion of a composite system. Very briefly: there is a correspondence 
between non-signaling bipartite states and bilinear forms, so that equivariant bipartite 
states give rise to SPIN forms. The game is to seek conditions on such a state that (i) 
have a clear physical (or operational, or probabilistic) meaning, and (ii) guarantee that 
the corresponding SPIN state is orthogonalizing. I'll start with a quick review of how 
composite systems are handled in the current framework. More detail can be found in 
lll|29lj!l 

3.1 Composite systems and non-signaling states 

Definition 11 (Composites): A composite of two models A and 5 is a model AB, plus 
an injection X{A) x X{B) X(AB), which I'll write as (x,y) ^ xy, such that 

(i) for all E G 21(A) and F e 2l(fi), EF := {xy\x eE,yeF}e 2l(AB), 

(ii) for all a e n(A), p e QiB), there exists some j e QiAB) with yixy) = aix)l3iy); and 

(iii) for all g £ GiA),h e G(SX there exists some k e GiAB) with kixy) - igx)(hy) for 
every xe X(A), ye XiB)S 

Condition (i) of the definition allows us to identify X(A) x X(B) with the set XiA)XiB) = 
{xy\x e X,y eY} of product outcomes in XiAB). Let us write 21(A) x 21(B) for the set 
of product tests, i.e., tests of the form EF provided for by condition (i). Evidently, every 
state in T restricts to a state on 21 x ; by (ii), the set of such restrictions contains all product 
states a ® j6, defined by (a ® p)ixy) - aix)l3iy). Also, by (iii), the stabilizer in GiAB) of the 
set X(A)XiB) extends the action on the latter of G(A) x GiB). 

Another consequence of condition (i) is that 

xi ±X2 in XiA) =^ xiy J-X2y in XiAB) 

for every choice of y £ X{B); likewise, if 3*1 -L y2 in X{B), then xyi ± xy2 for every x £ X{A). 
This observation will be exploited below. 

Remark: The category of all probabilistic models and morphisms has a natural product 
structure. Given models A and B, let A x B be the model with outcome set X{A) x X{B), 
test space 21(A) x 21(B) = [E x F\E £ 2l(A),F £ 2l(S)}, state space the convex hull of the 

^Another, unrelated, motivation is sketched in 1301. By choosing a fixed pure state eg, we can represent 
elements of E as continuous random variables on G, via x e X ^ x eR^, where xig) - a{gx). That the 
canonical inner product obtained by group averaging be minimizing — which, in view of Corollary 2, is 
equivalent to the existence of an orthogonalizing form, at least for irreducible models, is equivalent to the 
condition that the covariance cov(x,y) of two of these random variables be minimized precisely when the 
corresponding outcomes are distinguishable. 

^''The notation AB is not to be understood as referring (yet) to any particular operation of composition. 
That is, AB does not refer — as yet, anyway — to any particular composite. 
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product states, and symmetry group G(A) x G{B), acting as usual. (This is not a cartesian 
structure, since there are in general no morphisms A x B ^ A to serve as projections.) 
If we strengthen condition (ii) in Definition 11 to require that there exist a group ho- 
momorphism y/ : G(A) x G{B) G{AB) with y/(g,h){xy) = (gx)(gy), then g,h ^ k is a 
homomorphism, and x,}"-^ xy defines a morphism AxB ^ AB. 

A state w on a composite systenm AB is non-signaling IflTil iff it has well-defined marginal 
(or reduced) states 

^ coixy) =: a)2iy) and a)ixy) =: coiix), 

xeE yeF 

independent of the choice of tests E e 21(A), F e 21(5). In this case, for every y £ X{B) and 
X e X{A), we define the conditional states (Di\y and id2\x on (X,2l) and {Y ,), respectively, 
by 

(Dixy) (Dixy) 
(Oi\y{x) := — — and a)2\xiy) := — —. 
(t)2iy) cjiix) 

It is straightforward to establish the following bipartite laws of total probability for a 
non- signaling state w: 

(^1 = ^2iy)toi\y and co2=Y. (^i(x)co2\x (2) 

yeF xeE 

for any choices of tests F e 21(B) and E e 21(A). 

Definition 12: A composite AB of models A and B is non-signaling iff all of its states are 
non-signaling, and all conditional states belong to the designated state spaces of A and 
B — that is, a)2\x e niB) and a)i\y e D.{A) for all x £ X(A) and 3' £ XiB). 

In particular, then, if AB is a non-signaling composite in the sense just defined, then 
(oi £ n(A) and 0)2 £ HiB) for every state co e D.iAB). 

It is not hard to show (see Il26ll ) that if co is non- signaling, then it gives rise to a unique 
bilinear form SScd on E(A) x E(B) with SSa)ix,y) - a)ix,y) for all outcomes x £ XiA),y £ 
XiB). Thus, every non-signaling state w on AB is associated with a positive linear 
mapping E(A) - E(B)*, given by S(a)(6) = SSMib) for all a £ E(A),6 £ E(B). Since 
the conditional states (i)2\x and ii)i\y lie in V(A) and \{B), respectively, the range of this 
mapping is contained in V{B), so we can — and I shall — regard oi as a positive linear 
mapping 

& : E(A) - Y{B). 

If (,) is a self-dualizing inner product on E(B), we can re-interpret S as positive linear 
mapping S : E(A) E(B), given by the condition 

{w{a),b) =Bcjia,b). 
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Conversely, if w is given by a biliner form SSai, it must be non-signaling, since we then have 
for all £ £ 21, and similarly in the second argument. 
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Notice that S*(&)(a) = S(a)(6) = ci)ia,b). Notice, too, that Qiu)iy) - T-xeE^^i^^yh i-e., loiu) 
is the marginal of a> in r2(X(fi),2l(fi)), whence, S(x)/a(S(x)) is just the conditional state 
cl)2\x- Accordingly, S is called the conditioning map associated with w Where this map is 
an order-isomorphism E(A) V(B), we say that co is an isomorphism state [Tjl. 

Jargon: Let a> be a non-signaling state on AA, and let SSo) be the corresponding bilinear 
form. If is a SPIN form, I'll call w a SPIN state. 

A trivial but important example of a non-signaling state is the uniform (or maximally 
mixed) state on AA: p(z) = 1/n^ for every z e XiAA). The associated SPIN form, with 
parameters c = r^ = 1/n^, is exactly the degenerate, or uniform, SPIN form 

For later reference: 

Definition 13 (Local Tomography): A composite AB is locally tomographic iff bipartite 
states in Q(A5) are uniquely determined by their values on product outcomes — that is, 
iff for all a>i,a>2 G 0(AB), 

(i>i(x,y) = (D2(x,y)\/x e X(A),y eXiB) ^ 0)1 = 0)2. 

Remark: If AB is non-signaling, then (in our current, finite-dimensional setting), local 
tomography sets up a linear (NB: not ordered-linear) isomorphism E(A5) ^ E(A) 18 E(S). 
The cone on E(A)®E(5) obtained by carrying forward the cone E+(Afi) sits between 
the minimal (or projective) cone generated by the product states, and the maximal (or 
injective) cone consisting of all positive bilinear forms on E*(A)®E*(S) 14. .26.1 . 

Both non-signaling and local tomography conditions are routinely assumed (sometimes 
explicitly, sometimes tacitly) in recent discussions of composite systems in generalized 
probabilistic theories ( 11151 [51 [251 lllll . etc.). The non-signaling condition will be important 
in what follows, but the extremely powerful local tomogrcaphy assumption plays no role 
here at all (but see further comments in the Conclusion). 

3.2 Conjugate Systems 

In view of the fact that equivariant non-signaling states yield SPIN forms, it is temping 
simply to postulate the existence of a state a> on a composite AA with the property that 
o){xy) = for all x -L 3^ in X(A). Such a state would perfectly correlate every test £J e 21 
with itself, in that, where Alice and Bob perform the same test at their locations, they 
are guaranteed the same outcome. 

Unfortunately, in ordinary quantum theory, there is no such state: the candidate is the 
normalized trace, i.e., B{x,y) = Tr{PxPy) = \ {x,y)f', which corresponds to no bipartite den- 
sity operator. Fortunately, though, the strategy does work with a small modification. 
Consider a complex Hilbert space H and its conjugate space H, and let ^ g H ® H be the 
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(twisted?) Bell state 

xeE 

where E is any orthonormal basis. This is independent of the chosen basis, and perfectly 
correlates every observable with its conjugate analogue — indeed, <^,xiE>y) = ix,y), so 
that\C¥,x^y)f = \{x,y)f. 

This suggests the following idea. Recall from Section 1 that an isomorphism from a model 
A to a model B consists of a bijection : X(A) —>■ X{B) taking 21(A) bijectively onto 2l(-B), 
and such that )6 )8o0 is an affine isomorphism from Q.{B) to f2(A), plus an action of G(A) 
on B by elements of G(B), such that isomorphism : G(A) G{B) such that (p(^gx) = g<p{x) 
forallxeX(A),^eG(A). 

Definition 14 (Conjugate Models): Aconjugate for a model A is a structure (A,7a,?7a), 
where A is a model, 7a : A ^ A is an isomorphism, and t]a is a bipartite state (on some 
non-signaling composite) AA such that 

t]a(x,ya(x)) - lln 

for every x e X{A). I'll call ja the conjugation map and via, the correlator for the given 
conjugate. 

Example 7: Quantum Cases If A = A(H) is a quantum model associated with a complex 
Hilbert space H, let A = A(H); let 7a : X(JS.) be the mapping x^x (strictly speak- 

ing, the identity map!), and let riA{x,YAiy)) = \(^ ,x ^ y)f = TriP^>Px^y). As discussed 
above, this last is a correlator — obviously, symmetric and invariant. 

Lemma 5: If A has a conjugate, then it has a conjugate for which the correlator rjA 
is symmetric, in the sense that 'qiXjjAiy)) = V^y^TAix)), and invariant, in the sense that 
'nAigx,gy) = r]{x,y). 

Proof: Let T]'^(x,YA(y)) '•= iJ^y^TAix)). Observe that this is again a correlator. Averag- 
ing the two gives us a symmetric correlator. Now suppose t] is symmetric, and consider 
r;^(x,y) = r]igx,gy). This again is a symmetric correlator, so averaging over the group 
jdelds an invariant symmetric correlator. □ 

Convention: Henceforth, assume that correlators are symmetric and invariant. It follows 
that 

mia,b):=T]ia,rAib)) 

is an orthogonalizing SPIN form — and hence, if A is irreducible, an orthogonalizing 
SPIN inner product — on E(A). 

Theorem 1: Let A be bi-symmetric, and have a conjugate (A,ja,t]a)- Then the following 
are equivalent: 

(a) A is state-complete and t] is an isomorphism-state. 
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(b) A is self-dual with respect to the form S§{a,b) :- 7/^(0,7(6)). 

If A is irreducible, then B is an inner product, and (a) and (b) are equivalent to 

(c) A is state-complete and sharp. 

Proof: (a) => (b): If t] is an order-isomorphism, 77* takes E+'s extremal rays to those of 
V+. Since A is state-complete, the latter is Ei^. In particular, Tj*(x) - ?72(^)?7i|x is pure, 
and every pure state on E(A) looks like this. We therefore have transitivity of G on pure 
states of A, and also that f]*(x)iy) = SS{y,x) = SS{x,y), so that fyii^ = «5§(x, •) corresponds 
to a point in E+. Thus, E+ c E+. 

(b) => (a) Conversely, suppose A is self-dual with respect to SS. Let T = 77*oy:E^V = 
E* (the latter identity, one of linear spaces, not yet of ordered linear spaces). We have 
t(E+) c V+ ^ E* , so this is a positive mapping. Since E is self-dual with respect to S^, 
we have ker(T) < E"*" = E+; since the latter cone contains no subspaces other than 0, t is 
injective, and thus, in the present finite-dimensional setting, a linear isomorphism. The 
definition of t gives us SB{a,b) = T{b){a) for all a,b e E(A). Thus, for /3 = t(&), 6 g E, we 
have 

;6eE; o T(6)(a)>0 Va£E+ ^6gE+ 

(the last, by self-duality), whence, t~^(E* ) c E+. In other words, t is an order-isomorphism. 
Since 7 is also such, it follows that r]* is an order-isomorphism, i.e., rj^ is an isomorphism 
state. Moreover, we have 

e; = t(e+) = t(£;+)cv+, 

whence, 'K*^ = V+, i.e., A is state-complete. 

Suppose now that A is irreducible. Corollary 2 then tells us that the orthogonalizing 
SPIN form is an inner product. It follows from Lemma 2 that (c) ^ (a). 

(b) (c): We saw above that (b) implies state-completeness. Since A is irreducible, the 
orthogonalizing SPIN form is an inner product (indeed, = SSi, in the notation of 
Section 2.3). We have 6x '■= n{x\ e V", so that nx eE'^ = E+, with {nx,x} = 1. Since x 
is extremal in E+, 6x is extremal in V+ = E^. By state-transitivity, every pure state 
has the form 6y = n{y\ for some y ^ X. In particular, then, for every vector i; e E"*" with 
{v,x) = {v,u) = 1, we have ||i;|| = ||nx||. It follows that if {v,x) = {x,x) = 1, v = x. Thus, A is 
sharp. □ 

Where a correlator 77 is an isomorphism state, I'll call it an iso-correlator. Using this 
jargon, we have 

Corollary 3: Let A be state-complete, bi-symmetric, irreducible, and have a conjugate 
with an iso-correlator. Then A is self-dual. 

In II29II . I called a bipartite state o) £ AB on two rank-7i test spaces correlating iff, for 
some pair of tests E e 21(A) and F e 2l(S), there exists a bijection f : E ^ F such that 
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(i)ix,f{y)) = for X 7^ J*. Evidently, the correlator of a conjugation is correlating in this 
sense (choose E e 21(A) and F = jAiE) e 21(A), and let fix) = ja(x) for x £ E). The corre- 
lation condition of 11291 1301 requires that every state on A arise as the marginal of some 
correlating bipartite state on a composite of two copies of A. 

A stronger condition than the existence of a conjugate system, which will turn out to be 
useful, is the following 

Definition 15 (Strong Conjugates): A strong conjugate for a model A consists of a 
system A, an isomorphism jA '• - A, and a composite AA, such that for every state 
a G n(A), there exists a non-signaling state £ QiAA) satisfying 

(a) co" = a (that is, w" is a dilation of a) 

(b) a)"(gx,gy) - coix,y) for all g fixing a, and 

(c) 0)" is correlating along ja, in the sense that there exists at least one test £J e 21 
with a){x,x) = a(x) for all xeE (where, as above, x = ja(x)). 

Notice that a strong conjugate is (in effect) a conjugate, since we can take t] to be ojP, 
where p is the uniform state on A . 

Example 8: The quantum case. That the conjugate, A(H), of a quantum model A(H), 
is in fact a strong conjugate is essentially just the Schmidt decomposition. Let H be 
a Hilbert space, and, as above, let H denote the conjugate Hilbert space. For x,y eH, 
let xQy denote the operator on H given by (xQy)z = {z,y)x. In particular, if x is a unit 
vector, then X0X = Px, the orthogonal projection operator associated with x. The mapping 
x,y ^ xQ y is sesquilinear, that is, linear in its first, and conjugate linear in its second, 
argument. Hence, there is a natural linear isomorphism H ® H ^ SS(H.) taking x iS) y to 
xQ y. Suppose now that W is a density operator on H, diagonalized by an orthonormal 
basis E e 21(H). Then W has spectral resolution 

W = ^ XxPx = ^ ^xX X. 
xeE xeE 

The corresponding vector in H ® H is then 

'■= ^ A.xXQx. 

xeE 

Ifu,veE with u J-V, then for every xeE, we have either (x, u) = or {u,y) = 0, whence, 

C'yw,u^U) = Ax{x,u}{x,v) = 0. 

xeE 

Moreover, on the diagaonl, we have 

{Ww,u <S)u} = Ax\(u,x}\^ = {Wu,u}. 

xeE 
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Evidently, the pure state corresponding to W sets up a perfect correlation between E and 
its corresponding test E, along the canonical isomorphism x^x. Of equal note, if ^ is a 
unitary leaving W fixed, i.e, with gWg~^ = W, then the bipartite state (corresponding to) 
^ is also invariant under the diagonal action of G = U(H.): 



(where, for an operator a on H, a denotes the linear operator on H given by a{v) = av for 
all V e H.) 

3.3 Factorizable States 

Another way to motivate the existence of an orthogonalizing SPIN form on an irreducible 
system A is to suppose there exists an irreducible system B (perhaps another copy of A) 
and a non-trivial SPIN form S§ on a composite AB that factors, in the following sense: 

Lemma 6: Let AB be a bi-symmetric composite of bi-symmetric models A and B. Let SS 
be a SPIN form on E{AB), and suppose that factors, in the sense that, for all x,x' e 
XiA) and all y,y' e X{B), we have S§ixy,x'y') = SS\{x,x')SS2iy,y') where and are 
normalized bilinear forms on E(A) and E(B), respectively. Then and are SPIN 
forms. If A and B are irreducible, then either (i) is uniform, or (ii) SS\ and are 
orthogonalizing. 

Proof: That S^i and S^2 are both positive and symmetric is clear. To see that is 
invariant, note that 

S§i{a,b) = SSi(a,b)SS2(uB,UB) = S§(auA,buB). 
Thus if ^ e G, we have 

SSi{ga,gb) = SSi(ga,gb)SS2iuB,UB) = SS(gauB,gbuB) 

= mig,h)iauB)ig,h)ib,UB)) 
= ^(a,UB,b,UB) - SSiia,b) 

where heH is arbitrary; similarly for Now let c, r be the parameters associated with 
SS, and let ci,ri and 02,^2 be the parameters associated with ^1 and SS2, respectively. 
Let X J- y in XiA) and z L w e X{B). As observed abovJ^. it follows that xz -L yz and 
xz -L xw, so we have 

c = S^{xz,yz) = SBiix,y)SS2iz,z) = cirg and also c = SSixz,yw) - S^i{x,y)SS2{z,w) - ciC2- 
^^See the remarks following Definition 11 



(0igU,gv) 



Xx{x®x,gu®gv) 



K{x,gu){x,gv) 



Y ^xig ^X,u){g ^X,V) 
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Thus, cir^ = C1C2. If Bi is not orthogonalizing, then = C2- Since ^ is irreducible, S§2 is 
uniform. But now the same reaoning, with the roles of S§i and SS2 reversed, tells us that 
C1C2 = r^C2, whence, as C2 ^ 0, means that ci = r^, whence, ^1 is also uniform. But then 
SS — hence, co — is uniform as well. □ 

Definition 16: Let A and B have conjugates A and 5. A state a> on {AB)AB) is factorable 
iff iff there exist states a)A on AA' and o)b on BB' such that loixyxy) = coA(xx)a)Biyy). 

Applying Lemma 6 to the bilinear forms associated with the non-signaling states cj,a)A 
and a)B, we have 

Theorem 2: Let A and B be irreducible, and let co be a factorable equivariant state on 
iAB)(AB). Then either co is the uniform state, or wa and cob are orthogonalizing. 

Thus, if A is irreducible, B is a copy of A, and we can find a SPIN state on iAB)iAB) 
making AA and BB independent, we are guaranteed an orthogonalizing SPIN form. 



4 Monoidal Probabilistic Theories 

In the categorical approach to quantum foundations [H [21 124II . it is usually assumed — 
naturally enough — that a physical theory is a symmetric monoidal category 'I0, in which 
objects represent physical systems, morphisms represent physical processes, and the ten- 
sor product represents the physical composition of systems. A stronger, and perhaps more 
mysterious, assumption is that be dagger- monoidal, i.e, that it carry an involution 
compatible with the monoidal structure. In this section, I consider a symmetric monoidal 
category of bi-symmetric probabilistic models, and consider the associated "linearized" 
category E(^) consisting of the linear hulls of the models in 'io. The main result is that, 
if E(^) is consistent with the existence of reasonable dagger-monoidal structure (the ad- 
jective "reasonable" being spelled out in Definitions 19 and 20 below), then there exists a 
factorable SPIN form on E(A) for each model Ae'io; hence, irreducible models in 'io carry 
orthogonalizing SPIN inner products. Add the requirement that models in ^ are sharp 
and state-complete, and all models in are self-dual. 

4.1 Monoidal categories of probabilistic models 

Henceforth, ^ will denote a category of models, with morphisms as defined in section 
1.4. It is reasonable to require — and I shall require — a bit more, namely, that every 
symmetry g £ Ga is in fact a morphism in ^, i.e., Ga ^ ^(A, A). 

A symmetric monoidal structure on a category "tf is a bifunctor ® : "tf x ^ ^ <tf , plus a 
designated unit object I e^, and natural isomorphisms 

«A,fi,c : A®(B«)C) - (A«)B)® C, aA,B :A^B ^B® A, 

Xa:I ^A^A, and pA-A<8)I^A 
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for all A,B,C e^. These isomorphisms are also required to satisfy various coherence 
conditions, e.g., that Xa °crA,i = PA- See IfTOH for details. A symmetric monoidal category 
is a category equipped with such a structure. A dagger on a SMC is an endo-functor 
'\ \ such that, for all objects A g 'i^, = A, cr^ ^ = (Tb,a, and, for all morphisms 

(P,y/ in = (p and {(p ® y/)^ = (p^ ® y/^. A dagger-monoidal category is a SMC equipped 

with a dagger. 

Definition 18 (Monoidal probabilistic Theories): A monoidal probabilistic theory is 
a symmetric monoidal category of probabilistic models, such that 

(i) G(A) < '^(A, A) for every A e ^, 

(ii) for every A,B e^, the monoidal product A^B defines a non-signaling composite of 
A and B, in the sense of Definition 11 

(iii) the morphism AxB ^ A^B sending x £ X{A),y e X{B) to xy e X{A ®B), is a mor- 
phism in 

For each model A e we have the corresponding linear hull, the order-unit space E(A). 
Now, every morphism G '^{A,B) defines an affine mapping 0* : 0(S) 0.{A), given 
by 0*()6)(x) = fi{(p{x)). Pulling back again, we have a linear mapping (p** : Aff(n(A)) 
Aff(Q(fi)), which evidently takes E(A) to E(S). Thus, A E(A) is the object part of a 
covariant functor from 'lo to the category of ordered linear spaces and positive linear 
mappings, taking (p e '^iA,B) to the coresponding positive linear mapping (p : E(A) 
E(S) where (pia)il5) = aip o (p) for all a e E(A) and p g QiB). 

It is easy to check that the unit object for a monoidal probabilistic theory will necessarily 
be the trivial model T - ({1}, {{1}}, {1}, {e}) having one outcome, one test, one state, and one 
symmetry. It is an annoying fact that there are no morphisms between T and any non- 
trivial model. The linearized category EC^) will inherit the same defect. Thus, we'd like 
to extend the set of morphisms in the latter - at a minimum, we'd like to allow arbitrary 
linear mappings E(T) =^ [R ^ E(A) — representing elements of E(A) — as well as some 
linear mappings E(A) U, representing states, to count as morphisms. This suggests 
the following 

Definition 19 (Representations): A representation of a probabilistic theory "i^ is a func- 
tor n:'^ ^ S where (f is a category of order-unit spaces, such that 

(i) for all y, W G ^, Siy is a space of linear mappings, ordered by a cone ^+(y, W) 
of positive linear mappings; 

(ii) ii{A) = E(A) for every A G S^, and n{(p) - (p for every G '^(A,B), 

(iii) (?(K,E(A)) ^ E(A), for all A^S. 
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A representation is self-dual, resp. HSD, iff n{A) is self-dual, respectively HSD, for every 
model A G 

If ^ is a monoidal probabilistic theory, we can ask that S also be symmetric-monoidal, 
and that tt be a monoidal functor, i.e., n{AB) = niA) 18 niB) (at least up to a canonical 
isomorphism). In this case, we shall say that tt is a monoidal representation of^. 

4.1 Dagger-Monoidal Representations 

A basic assumption in the categorical approach to finite-dimensional quantum theory 
III mil?! is that the category of physical systems and processes should be, not just a sym- 
metric monoidal, but a da^^er-symmetric monoidal category. Roughly, the monoidal prod- 
uct A,B ^ A^iB is understood to capture the idea of a composite of two non-interacting 
(but possibly entangled) systems; the meaning of the dagger is a perhaps a bit more 
mysterious, but is suggestive of an operation of time-reversal. 

Definition 20 (Dagger-monoidal representations): A dagger-monoidal representa- 
tion of a monoidal category of probabilistic models is a monoidal representation n : 
^8 where 

(i) S is t-monoidal, with 7 =^ [R, 

(ii) i/^ o = 1 for all A G and 

(iii) n{g-^) = n{gi^ for all g e G(A), A g S?. 

I'll say that 'io is dagger-monoidal iff it has a dagger-monoidal representation. 

Subject to assumptions (i) and (ii), there exists, for each A g a canonical G(A)-invariant, 
positive, symmetric bilinear form on E(A) given by 

{a,b}:^aob* (3) 

with a, be E(A) ^ ^(/,E(A))@ Now, just by virtue of the monoidality of S, this bilinear 
form factors, in the sense that 

{a®b,c^d) = {a,c){b,d). 

This at once yields 

Theorem 3: Suppose ^ is 2-symmetric, and admits a dagger-monoidal representation. 
Then the canonical form (3) is orthogonalizing on every irreducible system Ae"^. 

The proof is virtually identical to that of Theorem 2. 

Combining this with Lemma 2 and Corollary 2, we have 
^^In general, this is not an inner product; the angle-bracket notation is, however, standard in this context. 
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Corollary 4: Let be as in Theorem 3, and let Ae'^ be state-complete, sharp and irre- 
ducible. Then the canonical bilinear form (3) is an inner product, with respect to which 
E(A) is self-dual. 

I'll call "t-SelfDual (t-SD) iff there exists a t-monoidal representation n:^ —^S where 
each n(A) is self-dual with respect to (3) — meaning, in particular, that this blinear form 
is an inner product for every A e 'tf . Corollary 2 tells us that if 'tf has a dagger-monoidal 
representation and every A £ 'i^ is state-closed, sharp, and irreducible, then is dagger- 
SD. 

Remark: Suppose that every object A e has a conjugate (.A,t]a,Ta) with A = A, in the 
sense that A = iA,T]A ° (^,7^) (here a:AxA^AxA is the obvious swap mapping). If 
the correlators tja are symmetric, in the sense that riA{a,b) = rjAib,A) for all a,b eE(A), 
then one can construct a dagger on E(A) as follows: if £ '^{A,B), set 

where ta : E(A) — E(A)* is the mapping given by TAio) - r]*{YAio,)), i.e., TA(a)(6) = 
^7A(&,TA(a))- With this definition of (p^ , one has <(/>(a),6) = {a,4>\b)) where (a, 6) := 
^7A(a,TA(6)), as above. (It would be interesting to know how the existence of a canoni- 
cal, involutive operation A A of conjugation on a monoidal probabilistic theory comes 
to making the category dagger-compact.) 



5 Image-closure 

Thus far, our results mainly concern irreducible systems. One way of extending them to 
possibly reducible systems involves a condition I'll call image-closure. 

Definition 21 (Image of a model): A morphism {(p,y/):A^B is surjective iff <piX) = Y, 
Q (pirn, H = y/{G), and D(B) = {/3 £ n(X(5),2l(fi))|0*(j6) £ H}. In this case, we call B the 
image of A under {(p,y/), writing B = 0(A). 

Notice that the image B = 0(A) can be simulated by A, as follows. To prepare B in state 
P, prepare the A in the state (p*{fi). To measure F = (p{E), measure E on A, and, upon ob- 
taining outcome x£E, record 0(x) as the outcome of F. To implement a symmetry heH, 
implement any corresponding symmetry g e y/~^(h) Q G. Operationally, it is reasonable 
(so long as we can prepare arbitrary states) to take 0(A) as a legitimate physical model 
whenever A is. 

Definition 22 (Image-closure): Call image-closed iff, for any model Ae^ and any 
surjective morphism = {(p,if/):A^B, (i) the model B belongs to S^, and (ii) O £ '^iA,B). 

Remark: The image of a 2-symmetric model is 2-symmetric. 
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It would be rather embarrasing, at this point, if the category '^qm of quantum models 
were not image-closed. In fact, however, a quantum model has no non-trivial images at 
all. 

Definition 23 (Incompressible Models): A model A is incompressible iff, for all models 
B, any surjective homomorphism : A ^ S is either an isomorphism, or is trivial in the 
sense that Xb is a single point. 



Lemma 7: Every quantum model is incompressible. 



Proof: see this, notice first that if (p : A ^ B is a surjective morphism of models, with G 
acting transitively on X(A), then X(B) is a transitive G-set. Hence, X{B) ^ G{A)/G(A)y, 
where G(A)y is the stabilizer of any y e X{B) under the action of G(A) on X(B). Since 
(p is equivariant, G{A)x < G{A)y for any x g X{A) with (pix) = y. Now suppose that A is 
quantum, i.e., A = A(H) for a complex re-dimensional Hilbert space H. Then X(A) = X(H), 
the set of rank-one projections on H, and G(A) = C7(H) ^ U(n), with G(A)^ ^ U(n - 1). 
Now, U{n - 1) is a maximal proper subgroup of t7(?i0 Hence, as G(A)x < G(A)-y, we 
have either GCA)^, = G(A)^ or G(A)y = G(A). In the former case, is a bijection, in the 
latter, X(B) is a point. Thus, a non-trivial surjective image of a quantum model A(H) = 
(Z(H), 21(H), n(H), Uin)) has the form (X(H), 21(H), T, Uin)), where T is the set of all states 
on (X(H), 21(H)) of the form po(p where p is (the state associated with) a density operator 
on H and (p is a symmetry of iX(H.), 21(H)). By Wigner's Theorem, (p has the form (pix) = 
V~^xV where y : H ^ H is either unitary or anti-unitary. We have, for every unit vector 
xeX(H), 

(p o 0)(x) = Tr(p0(x)) = TripV^xV) = Tr(ypy-^x) 

That is, the state p o is the state on (X(H), 21(H)) associated with the linear operator 
yopoy~^ — which is a linear, and thus a density operator, regardless of whether V is 



linear or anti-unitary. Thus, T = n(H). o D 



Remark: This argument shows that any model A such that (i) X(A) transitive under 
G(A), (ii) the stabilizer G(A)x of an outcome x £ XiA) is a maximal subgroup of G(A), and 
(iii) Q(A) invariant under symmetries of (X (A), 21(A)), is incompressible. 



Our goal now is to prove the following: 



Theorem 4: Suppose is an image-closed category of bi-symmetric models, having a 
dagger-monoidal representation. Then E(A) carries an orthogonalizing SPIN inner prod- 
uct for A^^. 

Let (X,2l,r2,G) be a fully symmetric model. Let (,) be a SPIN inner product on E, e.g., 
the one arising from group averaging. Suppose M < u"*- is a G-invariant subspace of U"*-. 
Let P : E ^ M be the corresponding projection operator (defined w.r.t. the standard inner 

^^Thanks to David Feldman for pointing this out 

^^If dini(H) > 2, Gleason's Theorem gives us p o e Q even more trivially. The preceding argument also 
works if dim(H) = 2. 
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product). For every x e X, set 

xi :=Pix) + u/n 

Then Y.xeEXi = P(u) + u = u (with P(u) = since M < u^). Let Xi = {xi|x e X}; for each 
Eem, set El = {xi\x e E}, and let 2li = {Ei\E £ 21}. Then is a fully symmetric 

G-test space. Since X spans E, Xi spans Ei := M® (u>. Let Ei+ denote the cone in Ei 
consisting of non-negative linear combinations of elements of Xi. 

Lemma 8: There exists a G-invariant, separating set fli of states on (Xi,2li) such that 
(i) Ai := (Xi,2ti,Qi,G) is a bi-symmetric model (in particular, G acts transitively on the 
extreme points of Qi), and (ii) the pair ((p,idQ), with (p : X —>■ Xi given by (pix) = xi, is a 
morphism of models. 

Proof: Let i; e E"*" represent a pure state, i.e., an extreme point of D. Set vi = P{v) + u, and 
note that 

{vi, u) - {P{u), u) + {u, u) = l 

and, for all x e X, 

<i;i,xi) = {P(v) + u,P(x) + u/n) = {P(v),P(x)) + lln = {vi,x). 

There is no gaurantee that this last will be positive for all x e X; however, we can choose 
e > so that 

Ve :=ci;i + {l-e)u 

belongs to E^ - and hence, to E| — since u lies in the interior of E"*". Now let 

f2i :=cdiGve) 

This is clearly a closed, convex, G-invariant set of states on (Xi,2li). We must show it is 
separating. Suppose 

{gVe,Xi) = {gVe,yp) (4) 

for all ^ e G. We have 

igVe,xi) = {evi + il-e)u,g~^xi) 

= {evi + {l-e)u,g~^Px + u/n} 

= € [{vi,g~^Px) + {vi,u/n) + (!-£:) [{u,g~^Px) + {u, u/n) 
= e{vi,g~^Px) +el/n + {l-e)l/n 
= e{v\,g~^Px) + lln. 

Similarly, {Ve,yi) - e{vi,g^^Py) + l/n. Thus, (4) implies {v-i_,g~^Px) - {vi,g~^Px), whence, 
{Piv)+u,g-^Pix)) = {Piv)+u,g-^P(y)), whence, {Piv),g-^P(x)) = {P(v),g-^P(x)). But this 
last is 

{gv,Px) = {gv,Py) 

for all £ G. Since {gv\g e G} is the full set of extreme points of H, it is separating for E. 
It follows that P(x) = P{y), i.e, x\ - y\. □ 
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If M is a minimal proper G -invariant subspace ofu^, the model Ai is irreducible. Hence, 
if Ai supports an orthogonalizing positive symmetric invariant bilinear form, then (by 
Corollary 2) this form is the standard SPIN inner product < , )i on Ei. 

Remark: The foregoing proof shows that if A is incompressible, then is irreducible in 
E(A). Thus, we can dispense with image-closure, if we are willing to focuss our attention 
on incompressible models: 

Theorem 4b: Let be a dagger-monoidal category of 2-symmetric probabilistic models. 
Then for every incompressible model A e S^, E(A) hosts an orthogonalizing SPIN inner 
product. If A is also state-closed and sharp, then E(A)+ is self-dual. 

Returning now to a general situation, let u^ = Mi ® • • • ® with each an irreducible 
invariant subspace for G. Let Pi,....,Pk be the corresponding projections, and, for each 
xeX, let Xj = Pjix) -\- uln, j = 1, n. Lemma 8 gives us, for each j, a bi-sjrmmetric model 
iXj,^j,Q,j,G), and, with this, a space E^ = ® (u) (ordered by the cone spanned by Xj). 
Finally, since each Aj is irreducible, Corlollary 2 gives us a standard (maximal) SPIN 
inner product (, )j on Ey 

Lemma 9: If {,)j is orthogonalizing for each j, then there exists an orthogonalizing inner 
product on E. 

Proof: With notation as above, let 

k 

{a,b), = Y,{Pjia),Pjmj + k{a,u){b,u). 
j=i 

This is clearly bilinear, invariant and sjrmmetric. Indeed, since each <,)^- is an inner 
product, so is (,)*. To see that it is positive on E+, note that for every x e X c E, x = 
iJ^jPix) + uln, so, for x,y €.X, we have 

{x,y)* = Y.{xj,yj)-^k/n^ 

j 

= Z ^PA^^ + u/n,Pj(y) -I- u/n}j 
j 

= L<xj,yj)j^o. 
j 

Since X spans E+(A), < , >* is positive. The same computation shows that if x -L y, so that 
Xj -L yj for each j, then, as {xj,yj)j = by hjrpothesis, <x,y)* = 0. □ 

Proof of Theorem 4: Let A = (X,2l,£2,G) be a model in ^, and proceed as above to con- 
struct models Aj - {Xj,^j,D.j,Gj) corresponding to the irreducible components of in 
E = E(A). By Lemma 8, Aj is the image of A under a surjective homomorphism. Since 
is image-closed, each Aj lies in S^. Since is also dagger-monoidal, we have a canonical 
invariant bilinear form (4) on each E(A), Ae^ , and this is orthogonalizing. Since Aj is 
irreducible, Corllary 2 tells us that this canonical form on E(Ay) = Ey must coincide with 
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standard form {,)j for all j = l,...,n, whence, the SPIN inner product < , )* of Lemma 9 is 
orthogonalizing. Theorem 4 now follows from □ 

This gives us 

Corollary 5: If ^ is an image-closed monoidal category of 2-symmetric models, admit- 
ting a '\-monoidal representation, then every state-closed, sharp model Ae'io is self-dual. 

6. Homogeneity 

Let "tf be an image-closed category of 2-symmetric probabilistic models. We've seen that 
if every model in 'lo has a conjugate, or if 'lo has a t-monoidal representation, then for 
every state-closed, sharp model A e"^, the cone E(A)+ is self-dual. If this cone is also 
homogeneous, then the Koecher-Vinberg Theorem tells us that E(A)+ is isomorphic to 
the cone of squares of a formally real Jordan algebra. 

There are several ways in which to motivate the homogeneity of E+(A), earlier explored 
in liSOU and 111. Before discussing these, let me mention one very direct interpretations 
of homogeneity. If we allow that all order-automorphisms (f> of E"*" ^ V(0) with u{(pia)) < 
uia) represent legitimate physical processes, then homogeneity simply requires that it 
be possible to prepare any state in the interior of the cone, with non-zero probability, by 
applying a reversible physical process to the maximally mixed state. The main objection 
to simply taking this as a postulate is probably just that the use of the adjective "interior" 
here seems unaesthetic. (Then again, we seldom scruple to accord special axiomatic 
privileges to pure states.) 

6.1 Self-Steering and Iso-Dilation 

In it is shown that homogeneity of the state cone follows from the assumption that 
every Ae'^ is "self-steering": 

Definition 24 (Self-Steering)): A system A has the Self-Steering property iff every state 
a G 0(A) arises as the marginal of some bipartite state co £ D,iA(^A) that is steering, in the 
sense that, for every convex decomposition tiUi = a of a as the average of an ensemble 
of other states, there exists an observable S = {aj on E(A) with w(ai, •) - ttai for each i. 

A less vivid, but mathematically simpler, assumption, also discussed in [7], is that every 
state in the interior of the state space, arise as the marginal of — or, in other language, 
can be dilated to — a bipartite isomorphism state, that is, a state co whose conditioning 
map, S, is an order-isomorphism E* =^ E. We might call this the Iso-Dilation condition. 
To see that this implies homogeneity, simply note that if a and ^ are any two interior 
states (not necessarily normalized), then by assumption there exist bipartite states wi 
and 0)2 with Si(j/) = a and S2(w) = /3, whence, (S2 = /3- Of course, there is still 

the (dubious?) aesthetic objection regarding the interior states. 
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That Self-Steering implies the homogeneity of the state cone is a consequence of the fact 
that any steering state on A 18 A having a marginal lying in the interior of the state cone, 
must be an isomorphism state. A priori, then, Iso-Dilation is the weaker condition. When 
V{A)+ is irreducible, isomorphism states are pure, so this is a relative of the "purification 
postulate" of [ 111 . 

6.2 Full symmetry, correlation and filtering 

Suppose A e 't^ is sharp and fully symmetric, rather than only 2-symmetric. Then we can 
use the "correlation" and "filtering" axioms from [30] to secure the homogeneity of E(A)+. 
Recall that a bipartite state <y on a composite AB correlates tests E g 21(A), F g iff 
there is a bijection f : E ^ F such that for all x,^ g x F with y ^ fix), CLi{x,y) = 0. In 
other words, on E xF, co is supported on the graph of f. In this situation, I'll say that o) 
correlates E and F along the bijection f. 

Definition 25 (Correlation Condition): A model A satisfies the correlation condition 
iff for every state a on A, there exists a model B, a composite system AB, and a correlat- 
ing bipartite state co on AB such that a>i = a. 

The Correlation condition (a dilation principle, like Steering and Iso-dilation) is by no 
means obvious on purely operational grounds. On the other hand, something like it is 
needed if we are to be able to capture measurement processes "internally", that is, in 
terms of the resources available in S^. For a further discussion of this point, see II29II . 

As noted in II291I30II . the correlation condition implies a kind of spectral decomposition for 
states: 

Lemma 10: Let A be sharp and satisfy correlation. Then for every state a on E, there 
exists a test E e 21(A) and convex coefficients tx with a = T-xeE tx5x- 

Proof: Let a = a»i where a* correlates E with F along f . Then, by (4), 

a = (^2iy)(iyi\x Y. ^2if(x))5x. 

yeF xeE 

Setco2(f{x)) = tx. □ 

Definition 26 (Filtering Condition): A satisfies the filtering condition iff for every test 
JS G 21, and every set of constants < - 1; there exists an affine automorphism O g E+ 
with $(x) = txX. I'll call such an automorphism a filter on E. 

Filtering is a reasonable assumption. If we think of a test E as, e.g., an array of detectors, 
then the axiom simply asserts that we can independently attenuate the reliabilities of 
these detectors — which, in practice, we can certainly do. 

Now suppose that A is sharp, and let 6x denote the unique normalized state on E with 
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(5x(x) = 1. If $ is a filter onEe 21(A) with $(x) = t^x, > 0, then 

and similarly, (t>*i6x)iy) = for 3/ Ix. It follows that t~^(^*{dx) = 6^, i.e., ^*i5x) = tx6x- 
As observed in II30II . we now have 

Lemma 11: Let A be sharp, state-complete, fully G-symmetric, and satisfy both the corre- 
lation and filtering axioms. Then V(A)+ is homogeneous. 

Proof: Let a and /3 be normalized states in the interior of E+. We wish to find some order- 
automorphism of E taking a to p. By Lemma 10, we can expand a and ^6 as a = ZxeS tx^x 
and 15 = T-yeF^y^y for some tests E,F e 21. Since a and )6 are interior, > and Sy>0 for 
all X G and y ^F. Let f :E ^ F he any bijection, and let O be a filter on E taking each 
xeE to where /rix = Sf(x)ltx- Then we have 

xeE xeE xeE 

By full symmetry, / extends to a symmetry g €G; applying this, we have 
^0*(a) = Y Sf{x)gSx = E SfxSfix) = ^y^y = j6. □ 

xeE xeE yeF 

Recall that if (A, 7^) is a strong conjugate for A, then for every state a eD., there exists 
an equivariant state on AA with coi = a, and correlating some test E £ 21(A) with the 
conjugate test E e 21(A) along x^x:- ya(x). This gives us the correlation property, and 
also, if A is sharp and state-complete, self-duality (by Theorem 1). Thus, we have 

Theorem 5: Let A be a sharp, state-complete, irreducible bi-symmetric model having a 
strong conjugate and satisfying the filtering condition. The E(A)+ is homogeneous and 
self-dual. 

We also have 

Theorem 6: Let ^ be an image-closed, dagger-monoidal probabilistic theory, in which 
every system is bi-symmetric and state-complete. If A is sharp and satisfies the Corre- 
lation and Filtering conditions, then E(A)+ is homogeneous and self -dual. 

(Notice that here, as in Theorem 4b, image-closure can be dropped, if we are willing to 
concentrate on incompressible models.) 



32 



6 Conclusion and Speculations 



The foregoing results show that the Jordan structure of finite-dimensional QM emerges 
very naturally from a few relatively simple constraints having reasonably clear opera- 
tional or physical meanings. Or, better to say, follow from any of several different clus- 
ters, or packages, of such constraints. Two of these are given in Theorems 5 and 6. Some 
others: 

(1) Individual systems are bi-symmetric, state-closed, irreducible, and has a 
conjugate system with an iso-correlator. Every interior state can be reversibly 
prepared from the maximally mixed state. 

(2) Individual systems are sharp, state-closed, irreducible, fully symmetric, 
and satisfy both the strong correlation and the filtering condition. 

(3) Systems collectively form an image-closed category with a dagger-monoidal 
representation, and individually are sharp, bi-symmetric and satisfy the steer- 
ing condition. 

Obviously, though, there's much left to do. Regarding (3), for example, while existence 
of a symmetric monoidal structure is not usually viewed as problematic, the existence of 
a dagger cries out for further explanation. One would like to find a compelling physical 
or operational interpretation for such a structure. (One attractive, though at this point 
vague, idea is that a dagger corresponds to a global time-reversal symmetry.) 

To all of these examples, there is an aesthetic objection: there are too many moving parts. 
It is likely, however, that the apparatus can be simplified. For example, there is a sense 
in which both full symmetry and filtering are expressions of the same idea: that any 
classically allowed, reversible process acting on the probabilistic apparatus associated 
with a single test, should extend to an irreversible process acting on the entire system. In 
terms of a slogan: any classically reversible process corresponds to a physically reversible 
process. Finally, it would be very desirable to replace the image-closure condition with 
some kind of reduction theory, according to which all systems in simply are direct sums, 
in some suitable sense, of irreducible systems. At present, I do not see how to obtain such 
a theory by anything short of fiat, but this may simply reflect lack of sufficient effort, or 



I have made no real effort to establish in detail how the various conditions enumerated 
here depend on one another, so there is the possibility that, given some of them, others are 

^^Alternatively, one could hope to show that (perhaps in the presence of other constraints), homogeneity 
already implies irreducibility. This is true, for example, if the group G comprises all unit-preserving order- 
automorphisms in the connected component of the identity of Aut(E). 
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simply redundant. It is also perfectly conceivable that these conditions are stronger than 
necessary. For example, I haven't checked to see whether every simple Jordan model has 
a conjugate, or satisfies filtering. At a more fundental level, it remains an important open 
question whether there exist any reore-C *-algebaic dagger-symmetric monoidal categories 
of formally real Jordan algebras. 

I want to emphasize again that local tomography has played no role here. In a forthcom- 
ing paper [9] with Howard Barnum, it will be shown that if ^ is a dagger-HSD category of 
order-unit spaces with non- signaling, locally tomographic composites, and if 8 contains a 
model having the structure of a qubit, then it is a category of finite-dimensional complex 
matrix algebras. 

Acknowledgements I am indebted to David Feldman, Howard Barnum and Jochen Rau 
for helpful comments and questions regarding earlier versions of this paper. 
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